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<110> APPLICANT: Dunn-Coleman, Nigel 
Goedegebuur , Fr i t s 
Ward, Michael 
Yao, Jian 

<120> TITLE OF INVENTION: EGVIII Endoglucanase and Nucleic Acids 
Encoding the Same 
FILE REFERENCE: GC700 

CURRENT APPLICATION NUMBER: US 10/028, 245A 
CURRENT FILING DATE: 2001-12-18 
NUMBER OF SEQ ID NOS : 5 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO: 1 
LENGTH: 1826 
TYPE: DNA 

ORGANISM: Trichoderma reesei 
SEQUENCE: 1 



<130> 
<140> 
<141> 
<160> 
<170> 
<210> 
<211> 
<212> 
<213> 
<400> 
gtcgacccac 
cccccatcac 
taacacacac 
tagcactttg 
tttgttcgac 
acgtacaatt 
tgtgcgacgc 
cttggccgtg 
cggaatcgac 
cctgctgagc 
cggcctcaac 



gcgtccgttc 
cgtcaccact 
tcgtttctgt 
tttcgttctt 
taggtagtgg 
aatacaccat 
tcctcgtttc 
gctggcgatg 
tttggctgcg 
tacaaaggag 
gtctttcgca 



attcttcctc 
ctcctcattg 
tactctcgct 
cgttctcttt 
taatatacgg 
ctcgttaatc 
tccctctcat 



ccctcctcct 
ccgctctctc 
gtcgtcggct 
taatccgtca 
acagcttttt 
ggatatatcc 
tatgcgcgca 



cggcaagctg gacgagctca 



ccctcgccgg caagatcaaa 
acatcgacgg cagctgtccg 
gagatggcgc cggccagatg 
tatccgctac atggcagttt 
actggggctc ctacaacaag 



cctcctcctc 
tgcgagccat 
ctgctcgttg 
tcttctgcaa 
ttccctcgct 
ctcggcctct 
acctcccttc 
tatctgggcg 
actgacacgt 
aagcatttcg 
gtcctcaaca 
gtcgtcaacg 



ctccccttct 
gacgcagcat 
gcattctgct 
tctgctgcca 
caacacgtcg 
tcctggtgct 
tggccgccgc 
tcgccattcc 
cgtctgtgcc 
ccgaagacga 
acacggtgga 
cctgtctcga 



39 gacgggcgcc tactgcatga ttgacatgca caactttgcc cgctacaacg gcggcatcat 
cggccaggga ggcgtgtcgg acgacatctt tgtcgacctc tgggtccaga tcgcaaagta 
ctacgaggac aacgacaaga tcatctttgg cctgatgaac gagccgcacg acctcgacat 
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53 



tgagatctgg gcgcagacgt gccaaaaggt 
ctcgcagatg atcctcctgc ccggaaccaa 
tggcagcgcg gaagccctcg gcaagattac 
ctttgatgtc cacaagtatc tcgacatcaa 
agacaacgtc gacgccttca acgacttcgc 
catcatctcc gaaacgggcg cgtccatgga 
gaacaaggcc attagcgaaa acagcgacgt 

tcttgactct 



cagctttgac acgtcgtaca 
cgacaacaag ctcatgaacg agtgcattct 
tccaacaccc acctcaattt ccacagcggc 
tgacggcgac gcgccatcca ctacgaagcc catctttagg gaagaaaccg 
tcccaatgct gttaccaagc cctcgcccga cacgagcgac tcttccgacg 



cgtcactgcg atccgaaagg ccggcgccac 
ctttgccagc gtcgagacgt atgtgtccac 
gaacccggat ggaagcaccg atttgctgta 
caactccggg tcgcacgccg agtgcaccac 
ggactggctg aggcagaaca 
accttcgtgc atgactgcct 
ctacattggc tttgtgggct 
gactcccctc ggcaagcccg 
ggaccagttt accctcgacg 
ggaagagacg gccacggcga 



agcgccaggc 
tctgcgccca 
ggggtgccgg 
gcaactacac 
aaaagtaccg 
cagcaacctc 
cctctcccac 
acgacaagga 
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54 ctcggcagca tctatgagtg cccagggctt gacaggcacg gtgctgttta ctgttgctgc 

55 ccttggctac atgctggtag cgttttgatg tttttttttt aatgagtttg tatacctaat 

56 gagcatgatt gagatgctac gtagtatata tgtctttacg ggtacataag actagagcca 

57 tgttgtaatc aaaaaaaaaa aaaaaa 

59 <210> SEQ ID NO: 2 

60 <211> LENGTH: 419 

61 <212> TYPE: PRT 

62 <213> ORGANISM: Trichoderma reesei 

64 <400> SEQUENCE: 2 

65 Gly Lys lie Lys Tyr Leu Gly Val Ala lie Pro Gly lie Asp Phe Gly 

66 1 5 10 15 

67 Cys Asp lie Asp Gly Ser Cys Pro Thr Asp Thr Ser Ser Val Pro Leu 

68 20 25 30 

69 Leu Ser Tyr Lys Gly Gly Asp Gly Ala Gly Gin Met Lys His Phe Ala 

70 35 40 45 

71 Glu Asp Asp Gly Leu Asn Val Phe Arg lie Ser Ala Thr Trp Gin Phe 

72 50 55 60 

73 Val Leu Asn Asn Thr Val Asp Gly Lys Leu Asp Glu Leu Asn Trp Gly 

74 65 70 75 ' 80 

75 Ser Tyr Asn Lys Val Val Asn Ala Cys Leu Glu Thr Gly Ala Tyr Cys 

76 85 90 95 

77 Met He Asp Met His Asn Phe Ala Arg Tyr Asn Gly Gly He He Gly 

78 100 105 110 

7 9 Gin Gly Gly Val Ser Asp Asp He Phe Val Asp Leu Trp Val Gin He 

80 115 120 125 

81 Ala Lys Tyr Tyr Glu Asp Asn Asp Lys He He Phe Gly Leu Met Asn 

82 130 135 140 

83 Glu Pro His Asp Leu Asp He Glu lie Trp Ala Gin Thr Cys Gin Lys 

84 145 150 155 160 

85 Val Val Thr Ala He Arg Lys Ala Gly Ala Thr Ser Gin Met He Leu 

86 165 170 175 

87 Leu Pro Gly Thr Asn Phe Ala Ser Val Glu Thr Tyr Val Ser Thr Gly 

88 180 185 190 

89 Ser Ala Glu Ala Leu Gly Lys He Thr Asn Pro Asp Gly Ser Thr Asp 

90 195 200 ^ 205 

91 Leu Leu Tyr Phe Asp Val His Lys Tyr Leu Asp lie Asn Asn Ser Gly 

92 210 215 220 

93 Ser His Ala Glu Cys Thr Thr Asp Asn Val Asp Ala Phe Asn Asp Phe 

94 225 230 235 240 

95 Ala Asp Trp Leu Arg Gin Asn Lys Arg Gin Ala He He Ser Glu Thr 

96 245 250 255 

97 Gly Ala Ser Met Glu Pro Ser Cys Met Thr Ala Phe Cys Ala Gin Asn 

98 260 265 270 

99 Lys Ala He Ser Glu Asn Ser Asp Val Tyr lie Gly Phe Val Gly Trp 

100 275 280 285 

101 Gly Ala Gly Ser Phe Asp Thr Ser Tyr He Leu Thr Leu Thr Pro Leu 

102 290 295 300 

103 Gly Lys Pro Gly Asn Tyr Thr Asp Asn Lys Leu Met Asn Glu Cys He 

104 305 310 * 315 ' 320 



1680 
1740 
1800 
1826 



file://C:\CRF4\Outhold\VsrJ028245A.htm 



5/23/05 



RAW SEQUENCE LISTING DATE : 05/23/2005 

PATENT APPLICATION: US/10/028 , 245A TIME: 09:30:02 



Input Set : A:\GC700-SEQLIST2.TXT 

Output Set: N:\CRF4\05232005\J028245A.raw 

105 Leu Asp Gin Phe Thr Leu Asp Glu Lys Tyr Arg Pro Thr Pro Thr Ser 

106 325 330 335 

107 He Ser Thr Ala Ala Glu Glu Thr Ala Thr Ala Thr Ala Thr Ser Asp 

108 340 345 350 

109 Gly Asp Ala Pro Ser Thr Thr Lys Pro He Phe Arg Glu Glu Thr Ala 

110 355 360 365 

111 Ser Pro Thr Pro Asn Ala Val Thr Lys Pro Ser Pro Asp Thr Ser Asp 

112 370 375 380 

113 Ser Ser Asp Asp Asp Lys Asp Ser Ala Ala Ser Met Ser Ala Gin Gly 

114 385 390 395 400 

115 Leu Thr Gly Thr Val Leu Phe Thr Val Ala Ala Leu Gly Tyr Met Leu 

116 405 410 415 

117 Val Ala Phe 

119 <210> SEQ ID NO: 3 

120 <211> LENGTH: 19 

121 <212> TYPE: PRT 

122 <213> ORGANISM: Tricho derma reesei 

124 <400> SEQUENCE: 3 . 

125 Met Arg Ala Thr Ser Leu Leu Ala Ala Ala Leu Ala Val Ala Gly Asp 

126 15 10 15 

127 Ala Leu Ala 

129 <210> SEQ ID NO: 4 

130 <211> LENGTH: 1317 

131 <212> TYPE: DNA 

132 <213> ORGANISM: Trichoderma reesei 

134 <4 00> SEQUENCE: 4 

135 atgcgcgcaa cctcccttct ggccgccgcc ttggccgtgg ctggcgatgc cctcgccggc 60 

136 aagatcaaat atctgggcgt cgccattccc ggaatcgact ttggctgcga catcgacggc 120 

137 agctgtccga ctgacacgtc gtctgtgccc ctgctgagct acaaaggagg agatggcgcc 180 

138 ggccagatga agcatttcgc cgaagacgac ggcctcaacg tctttcgcat atccgctaca 240 

139 tggcagtttg tcctcaacaa cacggtggac ggcaagctgg acgagctcaa ctggggctcc 300 

140 tacaacaagg tcgtcaacgc ctgtctcgag acgggcgcct actgcatgat tgacatgcac 360 

141 aactttgccc gctacaacgg cggcatcatc ggccagggag gcgtgtcgga cgacatcttt 420 

142 gtcgacctct gggtccagat cgcaaagtac tacgaggaca acgacaagat catctttggc 480 

143 ctgatgaacg agccgcacga cctcgacatt gagatctggg cgcagacgtg ccaaaaggtc 540 
14 4 gtcactgcga tccgaaaggc cggcgccacc tcgcagatga tcctcctgcc cggaaccaac 600 

145 tttgccagcg tcgagacgta tgtgtccact ggcagcgcgg aagccctcgg caagattacg 660 

146 aacccggatg gaagcaccga tttgctgtac tttgatgtcc acaagtatct cgacatcaac 720 

147 aactccgggt cgcacgccga gtgcaccaca gacaacgtcg acgccttcaa cgacttcgcg 780 

148 gactggctga ggcagaacaa gcgccaggcc atcatctccg aaacgggcgc gtccatggaa 840 
14 9 ccttcgtgca tgactgcctt ctgcgcccag aacaaggcca ttagcgaaaa cagcgacgtc 900 

150 tacattggct ttgtgggctg gggtgccggc agctttgaca cgtcgtacat cttgactctg 960 

151 actcccctcg gcaagcccgg caactacacc gacaacaagc tcatgaacga gtgcattctg 1020 

152 gaccagttta ccctcgacga aaagtaccgt ccaacaccca cctcaatttc cacagcggcg 1080 

153 gaagagacgg ccacggcgac agcaacctct gacggcgacg cgccatccac tacgaagccc 1140 

154 atctttaggg aagaaaccgc ctctcccact cccaatgctg ttaccaagcc ctcgcccgac 1200 

155 acgagcgact cttccgacga cgacaaggac tcggcagcat ctatgagtgc ccagggcttg 1260 

156 acaggcacgg tgctgtttac tgttgctgcc cttggctaca tgctggtagc gttttga 1317 
158 <210> SEQ ID NO: 5 
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<211> LENGTH: 438 
























160 


<212> TYPE: 


PRT 
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<213> ORGANISM: 


Trichoderma 


reesei 
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<4 00> SEQUENCE: 


5 
























164 


Met 


Arg 


Ala 


Thr 


Ser 


Leu 


Leu 


Ala 


Ala 


Ala 


Leu 


Ala 


Val 


Ala 


Glv 
^ j 


Asp 


165 


1 








5 










10 










15 




166 


Ala 


Leu 


Ala 


Glv 


Lvs 


He 


Lvs 


Tvr 


Leu 


Glv 


Val 


Ala 


He 


Pro 


Glv 
\j j. j 


He 


167 








20 










25 










30 






168 


Asp 


Phe 


Glv 


Cys 


Asp 


He 


Asp 


Gly 


Ser 


Cys 


Pro 


Thr 


Asp 


Thr 


Ser 


Ser 


169 






35 










40 










45 








170 


Val 


Pro 


Leu 


Leu 


Ser 


Tvr 


Lys 


Glv 


Glv 


Asp 


Glv 


Ala 


Glv 


Gin 


Met 


Lys 


171 




50 










55 










60 










172 


His 


Phe 


Ala 


Glu 


Asp 


Asp 


Glv 


Leu 


Asn 


Val 


Phe 


Arg 


He 


Ser 


Ala 


Thr 


173 


65 










70 










75 










80 


174 




Gin 


Phe 


Val 


Leu 


Asn 


Asn 


Thr 


Val 


Asp 


Glv 
j 


Lys 


Leu 


Asp 


Glu 


Leu 


175 










85 










90 










95 




176 


Asn 


T rn 
i j.p 


Gly 


Ser 


Tvr 
j 


Asn 


Lys 


Val 


Val 


Asn 


Ala 


Cys 


Leu 


Glu 


Thr 


Glv 


177 








100 










105 










110 






178 






Pwc 


Mpf 
nc l. 


He 


Asp 


Met 


His 


Asn 


Phe 


Ala 


Arg 


Tvr 


Asn 


Glv 


Glv 


179 






115 










120 










125 








180 


Tip 


J- JLC 


ftl V 


ftl n 


Gly 


Gly 


Val 


Ser 


Asp 


Asp 


He 


Phe 


Val 


Asp 


Leu 


Tro 

x ip 


181 




130 










135 










140 










182 

-L U Z. 


Val 


G'l n 

Vs3J.il 


J- J. c 


Al a 
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He 
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Gly 
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150 










155 










160 


184 
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O J- u 


Pro 


His 


Asp 


Leu 
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He 


Glu 


He 
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Gin 


Thr 


185 










165 










170 










175 




186 


Cys 


Gin 


Lys 


Val 


Val 


Thr 


Ala 


He 


Arg 


Lys 


Ala 


Gly 


Ala 


Thr 


Ser 


Gin 


187 








180 










185 










190 






188 


Met 


lie 


Leu 


Leu 


Pro 


Glv 


Thr 


Asn 


Phe 


Ala 


Ser 


Val 


Glu 


Thr 


Tvr 


Val 


189 






195 










200 










205 








190 


Ser 


Thr 


Glv 


Ser 


Ala 


Glu 


Ala 


Leu 


Glv 


Lys 


He 


Thr 


Asn 


Pro 


Asp 


Glv 


191 




210 










215 










220 










192 


Ser 


Thr 


Asp 


Leu 


Leu 


Tvr 


Phe 


Asp 


Val 


His 


Lys 


Tvr 


Leu 


Asp 


He 


Asn 


193 


225 










230 










235 










240 


194 


Asn 


Ser 


Gly 


Ser 


His 


Ala 


Glu 


Cys 


Thr 


Thr 


Asp 


Asn 


Val 


Asp 


Ala 


Phe 


195 










245 










250 










255 




196 


Asri 


Asp 


Phe 


Ala 


Asp 


Trp 


Leu 


Arg 


Gin 


Asn 


Lys 


Ara 


Gin 


Ala 


He 


lie 


197 








260 










265 










270 






198 


Ser 


Glu 


Thr 


Gly 


Ala 


Ser 


Met 


Glu 


Pro 


Ser 


Cys 


Met 


Thr 


Ala 


Phe 


Cys 


199 






275 










280 










285 








200 


Ala 


Gin 


Asn 


Lys 


Ala 


He 


Ser 


Glu 


Asn 


Ser 


Asp 


Val 


Tyr 


He 


Gly 


Phe 


201 




290 










295 










300 










202 


Val 


Gly 


Trp 


Gly 


Ala 


Gly 


Ser 


Phe 


Asp 


Thr 


Ser 


Tyr 


He 


Leu 


Thr 


Leu 


203 


305 










310 










315 










320 


204 


Thr 


Pro 


Leu 


Gly 


Lys 


Pro 


Gly 


Asn 


Tyr 


Thr 


Asp 


Asn 


Lys 


Leu 


Met 


Asn 


205 










325 










330 










335 




206 


Glu 


Cys 


He 


Leu 


Asp 


Gin 


Phe 


Thr 


Leu 


Asp 


Glu 


Lys 


Tyr 


Arg 


Pro 


Thr 


207 








340 










345 










350 






208 


Pro 


Thr 


Ser 


He 


Ser 


Thr 


Ala 


Ala 


Glu 


Glu 


Thr 


Ala 


Thr 


Ala 


Thr 


Ala 
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209 



355 



360 



365 



210 Thr Ser Asp Gly Asp Ala Pro Ser Thr Thr Lys Pro lie Phe Arg Glu 

211 370 375 380 

212 Glu Thr Ala Ser Pro Thr Pro Asn Ala Val Thr Lys Pro Ser Pro Asp 

213 385 390 395 400' 

214 Thr Ser Asp Ser Ser Asp Asp Asp Lys Asp Ser Ala Ala Ser Met Ser 

215 405 410 415 

216 Ala Gin Gly Leu Thr Gly Thr Val Leu Phe Thr Val Ala Ala Leu Gly 

217 420 425 430 

218 Tyr Met Leu Val Ala Phe 

219 435 
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